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Abstract 

Background: Primary care practices provide a gate-keeping function in many health care systems. Since depressive 
disorders are highly prevalent in primary care settings, reliable detection and diagnoses are a first step to enhance 
depression care for patients. Provider training is a self-evident approach to enhance detection, diagnoses and 
treatment options and might even lead to improved patient outcomes. 

Methods: A systematic literature search was conducted reviewing research studies providing training of general 
practitioners, published from 1999 until May 201 1, available on the electronic databases Medline, Web of Science, 
PsyclNFO and the Cochrane Library as well as national guidelines and health technology assessments (HTA). 

Results: 108 articles were fully assessed and 1 1 articles met the inclusion criteria and were included. Training of 
providers alone (even in a specific interventional method) did not result in improved patient outcomes. The 
additional implementation of guidelines and the use of more complex interventions in primary care yield a 
significant reduction in depressive symptomatology. The number of studies examining sole provider training is 
limited, and studies include different patient samples (new on-set cases vs. chronically depressed patients), which 
reduce comparability. 

Conclusions: This is the first overview of randomized controlled trials introducing GP training for depression care. 
Provider training by itself does not seem to improve depression care; however, if combined with additional 
guidelines implementation, results are promising for new-onset depression patient samples. Additional 
organizational structure changes in form of collaborative care models are more likely to show effects on 
depression care. 
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Background 

Depressive disorders are highly prevalent in the general 
public. The 12-month prevalence of Major Depression 
among Europeans has shown to be approximately 6.9% 
while conservative estimates of the lifetime prevalence 
range up to 14% [1,2]. Depression is associated with sig- 
nificant functional impairment and reduced quality of 
life [3,4], excess mortality rates [5] and particularly high 
costs for society and health care systems [6-9]. Consid- 
ering the large effects of the disease on individuals and 
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society, it seems clear that early detection and treatment 
is a desirable goal in order to promote remission and 
reduce negative consequences [10]. 

While 50 to 70 per cent of all depressed patients con- 
sult their primary care physician during an episode, 
therefore making them the profession most likely to be 
seen [11,12], depression in primary care remains under- 
recognised and under-treated [13]. As Bijl and collea- 
gues (2004) summarise, the elements of detection, diag- 
nosis and treatment determine successful depression 
management in health care. Previous trials showed that 
approximately 50 per cent of all depressed patients in 
primary care were not diagnosed as such [14-16]. This 
is a major downfall in depression care, since even 
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subthreshold depression episodes may be clinically sig- 
nificant [17]. Obviously part of this is due to reluctance 
to seek treatment among patients themselves, resulting 
from concerns on effectiveness of treatment and per- 
ceived absence of treatment necessity [18]. 

To objectify diagnoses, the use of screening instru- 
ments has been promoted by the U.S. Preventive Ser- 
vices Task Force, when adequate treatment and care 
possibilities are available [19]. Reviews showed that 
screening alone does not improve depression outcomes 
for patients [20], but needs further organizational 
changes [21]. These structural interventions, including 
collaborative care approaches as well as provider train- 
ing, represent an attempt to increase detection and diag- 
nosis of depressed patients and therefore promote 
enhanced treatment. Several treatment options that sup- 
port primary care physicians in treatment and that are 
also directly delivered by general practitioners (such as 
PST- problem solving therapy) have been found to be 
effective for depression [22,23]. 

Primary care practices play a central role in many 
health care systems- this kind of gate-keeping is even 
associated with improved coordination and outcomes 
[24]. This circumstance makes general practitioners 
ideal as a base for first steps in treatment, also referred 
to as a "stepped care" approach [25]. "Watchful waiting" 
and low intense interventions such as self-help 
approaches have to be encouraged as useful strategies 
[26]. In order to make full use of this opportunity, 
improvement of detection rates and diagnosis is inevita- 
ble. Improving skills of primary care providers can be 
achieved by different strategies. Consultation-liaison 
involves a persistent educational supervision of the gen- 
eral practitioner by a mental health specialist. This 
approach has not been shown to be effective in reducing 
depressive symptoms [27]. As indicated by Cape et al. 
(2010), another point of intervention can be to train pri- 
mary care providers in diagnosis and treatment strate- 
gies without the inclusion of mental health specialists 
(such as collaborative care), considering limited financial 
resources of health care systems [27]. Moderated by 
higher detection rates and better knowledge on treat- 
ment options, improvement on this level could subse- 
quently lead to higher remission rates in less time and 
improved depression outcomes. 

In the past, these programmes of provider education 
have been evaluated, yielding unclear results on effec- 
tiveness of the intervention regarding health gain out- 
comes [21]. This study therefore reviews current 
literature for an updated overview. It is the first over- 
view of randomized controlled trials that exclusively 
investigates interventions that apply practitioner 
training. 



Methods 

Literature Search 

This review was prepared according to the systematic 
literature review guidelines of the Centre for Reviews 
and Dissemination [28] and follows PRISMA (Preferred 
Reporting Items for Systematic reviews and Meta- Ana- 
lyses) suggestions [29]. A systematic literature search 
was conducted reviewing research studies, published 
from 1999 until May 2011, available on the electronic 
databases Medline, Web of Science, PsycINFO and the 
Cochrane Library as well as national guidelines and 
health technology assessments (HTA). In addition, the 
bibliographies of the selected articles were searched. 
Grey literature was not searched. 1999 as a starting 
point of the search was chosen to include at least the 
last 10 years of publications. The latest review on this 
topic, including studies from 1999 onward, was con- 
ducted in 2003, and we meant to include those studies 
as well [21]. The aim was to evaluate if newer, more 
recent studies would show clearer effects of the inter- 
vention than previous overviews. 

The terms (depression OR depressive disorder) AND 
(general practitioner OR general practice OR primary 
care OR family practice) AND (training OR education) 
served as search criteria within titles and abstracts. All 
terms were also used as MeSH terms where applicable. 
Test searches were run preceding the actual search in 
order to determine the right search terms. Additional 
File 1 shows the Medline search strategy in detail. The 
search was limited to English and German language 
publications. 

Inclusion criteria 

Abstracts were screened by two authors using the fol- 
lowing inclusion criteria: (i) randomized controlled trials 
(RCT) or review articles (ii) of the adult (> 18 years) 
general population, (iii) evaluating interventional pro- 
grammes including general practitioner training, men- 
tioned in the abstract and (iv) reporting effects on 
depressive symptomatology. All extracted review articles 
were scanned and hand-searched for further relevant 
publications from 1999 onward. 

Studies examining effects in specific study samples 
(such as diabetic patients with co-morbid depression) 
were excluded. Research of those specific samples was 
thought to provide only limited evidence for primary 
health care patients in general. All articles where a clear 
decision could not be made based on title or abstract 
were retrieved for a more detailed analysis. In case of 
disagreement, a third reviewer was consulted and then a 
consensus decision was achieved. 

Training and education of general practitioners was 
defined as a professional intervention [21] that involves 
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the use of guidelines or short training classes with a 
focus on optimising diagnosis as well as treatment. Stu- 
dies involving additional organisational interventions 
were excluded. Additional file 2 gives an overview on 
excluded studies. 

Data extraction 

Primarily, methodical data on sampling, study design, 
intervention procedure, and outcome criteria were 
extracted from all selected studies. Extraction of results 
focussed on assessing symptom alteration primarily. 
Only data related to a change in symptom severity 
(scale scores, remission rates) were extracted. Effect 
sizes were only calculated for the outcomes considered 
as relevant (symptom change). Secondly, the selection 
criteria described in the above section were then reap- 
plied to ensure accurate study inclusion. 

Study Quality 

Study quality was also assessed by two independent 
raters using a modified scale based on work by Mon- 
crieff and colleagues [30]. The scale was modified by 
leaving out irrelevant items such as medication side- 
effects. It consists of 21 items leading to a maximum 
score of 42 points (Table 1). Each item, if not specified 
otherwise, was scored as 2 (fully met the quality criter- 
ion), 1 (partially met the quality criterion) or as 0 (did 
not meet the quality criterion). After a first independent 
run-through of ratings, the two raters met with a third 
independent researcher in order to discuss disagree- 
ments in scoring until a consensus was reached. Study 
protocols were consulted where possible. 

Effect Size and meta-analysis 

Whenever applicable, standardized mean effect sizes 
(Cohen's d) were calculated from the data reported. At 
times, studies only reported scores that could not be 
used in effect size calculation and efforts to retrieve data 
directly from the authors were made. Data was entered 
to interpret negative standardized means in favour of 
the intervention. Results of cluster trials were used 
when the authors accounted for the effect of cluster ran- 
domization properly. According to Cohen (1988), effect 
sizes of 0.2 are considered small, while d = 0.5 repre- 
sents a moderate effect and d = 0.8 is regarded a large 
effect [31]. A meta-analysis was conducted using Review 
Manager Software [32]. Due to the diversity of GP train- 
ing in the studies, standardized mean effects were 
pooled - firstly, for studies with GP training only, sec- 
ondly, for studies introducing additional guidelines and 
lastly for studies including more complex interventions. 
Subgroup analysis (patient inclusion, age of patients) 
were not carried out due to the unavailability of suffi- 
cient data. Analysis of the heterogeneity of prevalence 



across studies was done through I 2 statistic and Cochran 
Q. A fixed effect model was applied since heterogeneity 
was low. 

Results 

Search results 

The results of the systematic literature search are shown 
in Figure 1. Interrater reliability showed substantial 
agreement (Kappa = 0.74). Overall, 108 potentially rele- 
vant articles were identified. After retrieving all full arti- 
cles, 97 further articles were rejected as not fulfilling the 
selection criteria. Eleven articles were assessed and 
included for detailed analysis. Relevant study character- 
istics can be found in Table 2. Three articles are double 
publications of the same studies and will be subtracted 
for the following overview. The QuEST intervention is 
described in detail in a publication by Rost et al. (2000). 
Therefore, this reference will be used when referring to 
that study. 

Study characteristics 

All, but one, studies were conducted in anglophone 
countries, among them Great Britain (n = 4) and one 
study each in Canada, the United States and Australia. 
The sample sizes varied from 145 [33] to 733 [34,35]. 
Three studies included patients with a categorical (e.g. 
diagnostic system based) diagnosis of depression 
[33,36,37], while the other five made use of symptom 
rating scales (self-report scales). Gask et al. (2004) and 
Worrall et al. (1999) both based their samples on refer- 
rals by the general practitioner (having the GP deter- 
mine whether the patient was depressed) but applied 
dimensional instruments to ensure accuracy of diag- 
noses [38,39]. All but one study used a cluster allocation 
design, randomising all included general practices to 
either intervention or control group. Only Llewellyn- 
Jones et al. (1999) conducted a serial designed survey, 
randomising each consecutive patient to the experimen- 
tal groups [40]. Four research groups planned to train 
the control group practitioners after the end of the trial 
while the other four had them assigned to usual care 
groups with no further support provided. However, 
three study teams chose to provide the physicians of the 
control groups with depression specific guidelines 
[36,39,40]. In the Dutch study it is highlighted, that all 
practitioners are generally encouraged to adhere to 
guideline concordant treatment [33]. 

Interventions 

As for interventional strategies, education and training 
of the participating general practitioners was the sole 
intervention in three studies [33,38,41]. These studies 
did not provide any other organisational support for the 
practices. King et al. (2002) pose an exception to the 
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Table 1 Quality assessment (Based on Moncrieff et al., 2001) 



Criterion 


Score and rating criteria 


(1) Objectives and specification 


0 = objectives unclear 


main outcomes a priori 


1 = objectives clear but main outcomes not specified a priori 




2 = objectives clear with a priori specification of main method 




for assessment of outcome 


(2) Adequate sample size (n per group) 


0 = inadequate (< 50/group) 




1 = moderate (50-1 00/g roup) 




z — idiye \> iuu/yioup oi jusuneu uy powei LdiLuidiioi isj 


(3) Appropriate duration of trial including follow up 


0 = too short (< 3 months) 




1 = reasonable length (3-6 months) 




2 = long enough for assessment of long term outcomes (6-12 months) 


(4) Power calculation 


0 = not reported 




i — iiiciiuoiieu wiuioui ueidiis 




2 = details of calculations provided 


(5) Method of allocation 


0 = unrandomized and likely to be biased 




1 = partially or quasi randomized with some bias possible 




2 = randomized allocation 


(6) Concealment of allocation 


0 = not done or not reported 




2 = concealment of allocation code detailed 


(7) Clear description of treatments (including doses of drugs used) and 


0 = main treatments not clearly described 


adjunctive treatments 


1 = inadequate details of main or adjunctive treatments 




2 = full details of main and adjunctive treatments 


(8) Blinding of subjects 


0 = not done 




1 = done but no test of blind 




2 = done and integrity of blind tested 


(9) Source of subjects described and representative sample recruitment 


0 = source of subjects not described 




1 = source of subjects given but no information on sampling or use 




of unrepresentative sample (for example, volunteers) 




2 = source of subjects described plus representative sample taken 




(for example, all consecutive admissions or referrals, or random 




sample taken) 


(10) Use of diagnostic criteria (or clear specification of inclusion criteria) 


0 = none 




1 = diagnostic criteria or clear inclusion criteria 




2 = diagnostic criteria plus specification of severity 


(11) Record of exclusion criteria and number of exclusions and refusals 


0 = criteria and number not reported 


reported 


1 = criteria or number of exclusions and refusals not reported 




2 = criteria and number of exclusions and refusals reported 


(12) Description of sample demographics 


0 = little/no information (only age/sex) 




1 = basic details (for example, marital status/ethnicity) 




2 = full description (for example, socioeconomic status, 




clinical history) 


(13) Blinding of assessor 


0 = not done 




1 = done but no test of blind 




I I • . • , r i i ■ i , , i 

2 = done and integrity of blind tested 


(14) Record of number and reasons for withdrawal by group 


0 = no info on withdrawals by group 




1 = withdrawals by group reported without reason 




2 = withdrawals and reason by group 


(15) Outcome measures described clearly (and therefore replicable) or use 


0 = main outcomes not described clearly 


of validated (or referenced) instruments 


1 = some of main outcomes not clearly described 2 = main outcomes 




clearly described or valid and reliable 




instruments used 


(16) Information on comparability and adjustment 


0 = no information on comparability 


for differences in analysis 


1 = some information on comparability with appropriate 




adjustment 




2 = sufficient information on comparability with appropriate 




adjustment 


(17) Inclusion of all subjects in analyses (Intention to treat analysis) 


0 = no 




2 = yes 


(18) Presentation of results with inclusion of data forre-analysis of main 


0 = little information presented 


outcomes (for example, SDs) 


1 = adequate information 




2 = comprehensive 
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Table 1 Quality assessment (Based on Moncrieff et al., 2001) (Continued) 



(19) Appropriate statistical analysis (including correction for multiple tests 


0 


= inadequate 


where applicable) 


1 


= adequate 




2 


= comprehensive and appropriate 


(20) Conclusions justified 


0 


= no 




1 


= partially 




2 


= yes 


(21) Declaration of interests (for example, 0 = no 


0 


= no 


source of funding) 


2 


= yes 



other studies, since physicians here are trained to pro- 
vide a specific interventional method (brief cognitive 
behavioural therapy) [41], while in the remaining trial 
physicians were only provided with lectures on 



assessment and treatment of depression. Four studies 
made use of guideline implementation [35-37,39]. These 
studies can be seen as providing a more intense inter- 
vention, as practitioners were trained and additionally 



c 
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Records identified through database search 



PubMed 
(n = 493) 



Web of 
Science 
(n = 1218) 



EMBASE 
(n = 1468) 



Records screened after duplication deletion 
(n = 3595) 



Additionally found 
in reference lists 
(n= 6) 



Full-text articles assessed 
for eligibility 
(n = 108) 



Studies included in 
qualitative synthesis 
(n = ll) 



Studies included in 
quantative synthesis 
(n = 6) 




Records excluded 
(n = 3496) 



Full-text articles excluded, 
with reasons 
(n = 97) 
71 reviews/metaanalyses 
8 GP training not part of 
the intervention 
7 no relevant outcome 
criteria 

4 collaborative care inter- 
vention (too complex) 
3 no RCT 

2 special sample (high 
utilizers) 

1 no focus on depression 
care 

1 no untrained control 
group 



Figure 1 Search Strategy. 
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Table 2 Study Characteristics 


Study 


Country 


Recruitment 


Randomization 


N a 


Intervention, Role of GP Training 


Control 


Comparison 


Quality 






Inclusion Criteria 








Group 






Baker 


GB 


Consecutive patients; 


Practices 


402 


Tailored intervention to promote 


UC b 


Patients of 


30 


(2001) [36] 




> 1 8 yrs 






guideline implementation 




experimental group 








Patients seeking 






(additional feedback, educational 




vs. control group 








consultation for new- 






visits, group discussions) 












onset depression 






Additional to guideline 








Bosnians 


NL 


Consecutive patients; 


Practices 


145 


4 hrs training session on screening, 


uc 


Patients of 


39 


(2006) [33] 




> 55 yrs 






diagnosis and treatment as in 




experimental group 








PRIME-MD = MD 






Dutch guidelines 




vs. control group 




Gask (2004) 


GB 


GP referral; 


Practice 


189 


Acquisition of clinical skills, 5 


WL 


Patients of 


36 


[38] 




16-65 yrs 






lectures a 2 hrs on assessment and 




experimental group 








Intention or current 






treatment; 




vs. control group 








Li call I Icl 1 1 Ul 






OUIc IIILcI VciRIUII 












depression 


















(symptoms < 6 mo) 


















HAM-D > 13 














ixfcri 1U I ILK 


GB 


C nncori iti\/o i"\ ratio ntc - 
L.UI IbcLU LI Vc pdLlcllLb, 


r IdLLILc 


/ jj 


vjUIUcIIIIc 1 1 1 1 picl I Icl I Ld LIUI I (y. ur 


WL 


rdLlcl lib Ul 


JO 


(2001) [34] 




> 1 6 yrs 






training (4 h seminars, educators 




experimental group 




Thompson 




HADS-D > 8 






available for 9 more mo); 




vs. control group 




(2000) [35] 










Additional to guideline 




Sensitivity of 


















recognition rates of 


















experimental group 


















vs. control group 




King (2002) 


GB 


Consecutive patients; 


Practice 


272 


Training of GPs in brief cognitive 


WL 


Patients of 


34 


[41] 




> 1 8 yrs 






behaviour therapy (4 half day 




experimental group 








HADS-D/A > 1 1 






workshops); 




vs. control group 














Sole intervention 








Llewellyn- 


AUS 


Residential facility; 


Patient 


220 


Shared Care Intervention, including 


WL C 


Experimental group 


33 


Jones 




> 65 yrs 






ur Lldllllliy (y. cUULdLIUII, Ileal II I 




vb. luiiliui yiuup 




(1999) 




GDS > 10 






education and promotion, 








[40] 




MMSE > 18 






psychoeducation; 


















Central part of complex 


















intervention 








Rost (2001) 


USA 


GP referral; 


Practice 


479 


QuEST intervention, 4 academic 


UC 


Patients of 


38 


[44] 




DSM lll-R MD (latter 






telephone calls to implement 




experimental group 




Pyne 




two studies exclude 






guidelines, nurse w/8-hour face-to- 




vs. control group d 




(2003) [43] 




patients currently in 






face training; 








Rost (2005) 




treatment) 






Guidelines implementation 








[45] 


















Worrall 


CAN 


GP referral; 


Practice 


147 


3-hour sessions on guideline 


UC b 


Patients of 


26 


(1999) [39] 




GP diagnosis and 






implementation + possible 




experimental group 








severity rating, later 






consultation of psychiatrist- 




vs. control group 








CES-D > 16 






Guidelines implementation 









a At baseline; 6 Receipt of guidelines by mail; c Assessment of control group during first period of study, followed by intervention and assessment of experimental 
group; d Rost et al. (2001): comparison recently treated patients vs. patients beginning new treatment episode, analysis for both intervention groups. 



Abbreviations: CES-D - Center for Epidemiologic Studies Depression Scale; CIDI - Composite International Diagnostic Interview; GDS - Geriatric Depression Scale; 
GP - General Practitioner; HADS-D/A - Hospital Anxiety and Depression Scale; HAM-D - Hamilton Depression Scale; MD - Major Depression; MMSE - Mini Mental 
Status Examination; mo - month(s); PRIME-MD - Primary care Evaluation of Mental Disorders; Ql - Quality Improvement; UC - Usual Care; w/- with; WL - Waiting 
List; yrs - years. 



received guidelines and guideline explanations. Rost et 
al (2000) and Worrall et al. (1999) focussed the GP 
training on implementing guidelines [37,39]. Thompson 
et al. (2000) also educated practitioners but additionally 
tried to implement guideline concordant treatment [35]. 
Baker et al. (2001) used a tailored application of practi- 
tioner training by firstly analysing possible obstacles for 
successful guidelines implementation and then deliver- 
ing individualised help to the GPs [36]. 



Regarding more complex interventions, it can be con- 
cluded that provider education plays the central part in 
the programme conducted by Llewellyn-Jones et al. 
(1999). 

The mean quality score of all studies was at 34.91 
points and ranged from 26 to 39 (individual scores in 
table 2). Criteria such as random allocation as proposed 
by the Cochrane Collaboration Handbook were ade- 
quately addressed by all studies [42]. 
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Effectiveness of provider training 

Table 3 summarises all study results. The three studies 
solely providing physician education found no change in 
symptom severity. Neither lectures for more qualified 
assessment and treatment [33,38], nor training in brief 
cognitive behavioural therapy [41] led to significant 
symptom change in patients of trained physicians. Intro- 
ducing additional guidelines and using them during 
practitioner training, two studies showed a mid and 
long term significant change in symptom load 
[39,43-45]. Both trials report small effect sizes (d = 0.22- 



0.29). Short term, one study was able to show an 
increases probability of reducing the depression score 
below a clinically relevant cut-point [36]. Yet, another 
study fails to show effects of the intervention introdu- 
cing guidelines. Not only was there no effect of the 
practitioner training on diagnosis sensitivity and specifi- 
city, patient also do not profit symptom wise or regard- 
ing hospital admittance [34,35]. 

As for the more complex interventional strategies, 
there is one study in which general practitioner and pro- 
vider training plays a central role [40]. In a sample of 



Table 3 Study Results 



Study 


Follow 
Up 


Attrition 
Rate % a 


Outcome 


Results 


Limitations 


Effect 
Size b 


Baker 
(2001) [36] 


16 

weeks 


6 


Proportion of 
patients w/ 
BDI < 11 


Sign, diff in proportion of patients w/BDI > 
11 (OR = 2.5) 


Tailored intervention that makes GP 
comparison impossible since they all 
received diff kinds of intervention 


/ 


Bosnians 
(2006) [33] 


12 mo 


21 


PRIME-MD 


No sign, diff in MD recovery 


Possible Hawthorne effect, less severe 
episodes of MD in primary care, no 
blinding of patients 


-0.07 


Gask (2004) 
[38] 


3, 12 
mo 


37 


HAM-D 


No sign, diff in scores at both follow up 
points 


Use of a new-onset (depressed less than 6 
mo) sample 

Attrition rate rather high 


-0.24 


Kendrick 
(2001) [34] 


12 mo 


19 


Hospital 
Admittance 


No sign, difference 


Implemented guidelines had been tested 
in highly selected samples 
Dimensional diagnosis 
Potential conservative bias (chronic 
depressed patients) 


/ 


Thompson 
(2000) [35] 


6 

weeks/ 
6 mo 


50 


HAD 


No diff in improvement, no diff in caseness 
rating at both points 
Only patients recognized as cases at 
baseline improve sign, during first 6 weeks 
(p = 0.044), no diff at 6 mo 


See Kendrick (2001) 


/ 








Diagnosis 
sensitivity 


No diff in sensitivity nor specificity 


See Kendrick (2001) 


/ 


King (2002) 
[41] 


6 mo 


10 


BDI 


No sign, diff in BDI scores (p = 0.84) 


Smaller sample than anticipated 
Cut off score for inclusion rather high 
(makes intervention effect of CBT by lay 
GPs less likely) 


0.08 


Llewellyn- 
Jones 
(1999) 
[40] 


9.5 mo 


23 


GDS 


Sign, change in GDS scores 


Serial mono-centered design 
Control group assessment before 
implementation of intervention 


-0.17 


Rost (2001) 
[44] 


6 mo 


10 


mCES-D 


Sign, reduction in score in patients 
beginning new treatment episode 


GP training effect unclear, feedback of 
diagnosis could be responsible for 
treatment effect 
Homogenous sample 


-0.29 


Pyne 

(2003) [43] 


12 mo 


65 


Depression 
severity 


Sign, decrease (7.7 units) in experimental 
group 


See Rost (2001) 


/ 


Rost (2005) 
[45] 


24 mo 


70 


Depression 
Free Days 


Sign, more depression free days in 
experimental group (647.6 vs. 558.2) 


See Rost (2001) 


/ 


Worrall 

(1999) [39] 


6 mo 


? 


Gain score 
CES-D 


Sign, improvement in experimental group 


Possible Hawthorne effect 


-0.22 



a all groups, last follow up; b calculated as the mean difference of experimental and control group at the latest post-treatment measurement; 0 adopted from 
Bower et al. (2006) [45]. 

Abbreviations: BDI - Beck's Depression Inventory; CBT - Cognitive Behavioral Therapy; CE Ratio - Cost Effectiveness Ratio; mCES-D - (modified) Center for 
Epidemiologic Studies Depression Scale; CIDI - Composite International Diagnostic Interview; diff - difference; GDS - Geriatric Depression Scale; GP - General 
Practitioner; HAD - Hospital Anxiety and Depression Scale; HAM-D Hamilton Depression Scale; mo - month(s); MD - Major Depression; PRIME-MD - Primary care 
Evaluation of Mental Disorders; QALY - Quality of Adjusted Life Year; SF12 - Short Form 12-ltem Health Survey; sign - significant 
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elderly (65+) adults in a residential facility, Llewellyn- 
Jones and colleagues (1999) show a significant change in 
symptom quantity associated with a small effect size of 
Cohens d = 0.17. 

In studies that provided guidelines for the non-trained 
control group practitioners [36,39,40], additional train- 
ing and interventions in the experimental groups led to 
positive outcome changes (see tables 2 and 3). 

Meta-analysis 

The forest-plot of the conducted meta-analysis can be 
found in Figure 2. Three studies with only provider 
training provided data for meta-analysis. They found a 
non-significant decrease in depressive symptom load 
(pooled effect size d=-0.07, 95% CI -0.24 to 0.10, I 2 = 
21%). 

Two studies were categorized as implementing addi- 
tional guidelines to primary care. They showed the high- 
est pooled effect sizes and a significant decrease in 
depressive symptoms in the intervention groups (d=- 
0.26, 95% CI -0.48 to -0.04). The overall effect size, 
including a study with a more complex intervention was 
determined at d=-0.15 (95% CI -0.27 to -0.03]), favour- 
ing the experimental groups. 

Discussion 

It is apparent that there are only few trials evaluating 
the effect of primary care physician education on health 
outcomes of patients. Especially during the last six years 
no results of education based interventions have been 



published. It seems that the research focus has shifted 
to more complex interventions encompassing collabora- 
tive and stepped care approaches by introducing mental 
health specialists to the primary care setting [46-48]. 
Regarding the results of the reviewed studies, this 
approach seems more than justified - it has yet to be 
shown that training practitioners alone yields significant 
symptom changes; however, this conclusion is only 
based on three relevant studies that themselves are 
highly diverse. While the study by Gask et al. (2004) 
struggles with high attrition rates, King et al. (2002) 
used a rather high cut-off and included chronically 
depressed patients, possibly leading to a conservative 
bias and therefore underestimating the treatment effect 
[38,41]. The authors argue that the applied kind of brief 
cognitive behavioural therapy might have been treat- 
ment approach not sufficiently intense for highly 
depressed patients. Bosmans et al. (2006) find that 
including less severely affected patients might have led 
to an underestimation of the efficacy of anti-depressant 
treatment [33]. 

Sample selection plays a major role in assessing treat- 
ment effects in general. One could assume that severely 
affected patients benefit more from treatment in clinical 
studies (as they can show a higher reduction in quantity 
of symptoms). In line with this, a categorical diagnostic 
approach for patient inclusion by applying diagnostic 
categories as provided by the Diagnostic and Statistical 
Manual of Mental Disorders (DSM) might lead to a 
sample of more severely affected patients [33,36,37]. 



Experimental Control 



Sti h I y m S< il hj i n i 1|> I. Meai i Diff ei ei ice 



SE 



Total Total Wehjht 



Std. Mean Difference 
IV, Fixed, 95% CI 



Std. Mean Difference 
IV, Fixed, 95% CI 



1.1.1 Only GP training 



Bosnians 2006 


-0.07 


0.1794 


58 


67 


12.2% 


-0.07 


[-0.42, 0.28] 


Gask 2004 


-0.24 


0.1461 


97 


92 


18.4% 


-0.24 


;-0.53, 0.05] 


King 2002 


0.08 


0.1384 


104 


105 


20.5% 


0.08 


;-0.19, 0.35] 


Subtotal (95% CI) 






259 




51.0% 


-0.07 [-0.24, 0.10] 


Heterogeneity: Chi== 2.53, df = 2 (P = 


0.28); l= 


= 21% 












Test for overall effect: Z = 0.81 (P = 0 


42) 
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Rost 2001 


-0.29 


0.1463 


97 


92 


18.3% 


-0.29 [ 


0.58, -0.00] 


Worrall 1999 


-0.22 


0.1703 


91 


56 


13.5% 


-0.22 


[-0.55, 0.11] 


Subtotal (95% CI) 






18© 


m 


31.8% 


-0.26 [- 


0.48. -0.04] 


Heterogeneity: Chi== 0.1 0, df = 1 (P = 


0.76); l= 


= 0% 












Test for overall effect: Z = 2.35 (P = 0 


02) 
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-0.17 


0.1513 


86 
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17.1% 


-0.17 


[-0.47, 0.13] 
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83 


17.1% 
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Heterogeneity: Not applicable 
















Test for overall effect: Z = 1 .1 2 (P = 0 
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100.0% 


-0.15 [- 


0.27, -0.03] 


Heterogeneity: Chi== 4.44, df = 5 (P = 


0.49); l= 
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Test for overall effect: Z = 2.37 (P = 0 


02) 














Test for subgroup differences: Chi = = 1 .82. df = 


2(P = 0.40), l= = 


0% 
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Figure 2 Forest-Plot. 
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Furthermore, the kind of treatment has an effect. In 
the context of stepped care, this issue is addressed by 
providing more-intense treatment options to higher 
affected patients [25]. 

Both argumentations can provide explanations for the 
positive results found by studies implementing additional 
guideline usage by general practitioners. Small effect 
sizes were shown by those studies including patients with 
new-onset depression, rather than chronically depressed 
patients (as done by Kendrick et al., 2001 not resulting in 
positive symptom change). Obviously, the effect of mere 
attention to the trained practitioners as well as to the 
patients themselves (referred to as Hawthorne effect) has 
to be considered a possible moderating variable in study 
designs. This would lead to better outcomes and perfor- 
mances of the control groups even though they received 
no active intervention; thus, the differences found may 
possibly be even higher. 

The justification for more complex interventions to 
improve primary care depression treatment is replicated in 
the analysis of included studies and basically goes in line 
with a previous review [21], however, we did find more 
evidence in newer studies that support guideline imple- 
mentation to be effective in symptom reduction. One trial 
applying more complex strategies both yielded significant 
changes in symptom outcomes; however, it remains 
unclear how much of the effect can be attributed to the 
physicians' education. Bower et al. (2006) conducted a 
meta-regression to evaluate active ingredients of collabora- 
tive care interventions [49] . In this analysis, primary care 
physician training is not associated with a positive change 
in depressive symptoms nor with a change in antidepres- 
sant use even in univariate calculations. Rather than provi- 
der education, systematic identification of patients, 
professional background of staff and supervision proved to 
be significant predictors of symptom change. It becomes 
clear that researchers should not assume an additive effect 
of treatment modules; especially in view of economic con- 
siderations, collaborative care cannot mean implementing 
as many treatment options as possible. This analysis of 
one specific potentially effective part of collaborative care 
is leading the way to a more thorough understanding of 
complex interventions and has to be pursued without 
neglecting the fact that more simple interventions can also 
lead to significant changes in patient outcomes as shown 
in this review. 

However, it may not be appropriate to solely focus on 
outcomes of symptomatology as enhanced primary care 
supply may not be directly associated with such. Even 
the included studies show a rise in adherence to medi- 
cation treatment [39] and medication treatment in gen- 
eral [34,44,50]. It has been shown that effectiveness of 
antidepressant treatment increases with depression 



severity [51]; an effect of increased antidepressant treat- 
ment in a sample of mildly depressed patients will 
therefore be small [as seen in the studies by [33,39]]. 

Strengths and Limitations 

This review only included randomised controlled trials, 
and therefore neglected observational and non-rando- 
mised studies. RCTs often adhere to strict exclusion cri- 
teria, thus making generalisability to primary care patients 
difficult. This also applies to the current review since stu- 
dies with specialised co-morbid patient groups were 
excluded; however, regarding the heterogeneity of primary 
care patients, an adequate representation of studies seems 
hard to achieve in any case. The reported studies differ 
substantially in content, duration, intensity and frequency 
of the intervention programmes, making comparisons dif- 
ficult. However, we were able to conduct a meta-analysis, 
quantifying the results of the studies. Meta-regression that 
could help determine the influence of these factors was 
not applicable due to the limited number of studies. 

This partly results from the relatively narrow search 
strategy; only when education or training efforts of GPs 
were mentioned within title and abstract, the article was 
found with the applied search strategy. Earlier publica- 
tions (before 1999) were not searched. Gilbody et al. 
mention one previous trial that showed positive effects 
of provider training but equally emphasise methodologi- 
cal weakness of this trial [21,52], so we did include rele- 
vant trials that live up to methodological requirements.". 

Furthermore, a possible publication bias cannot be 
ruled out or determined with a qualitative review as 
this, especially under the regard of not searching grey 
literature. Regarding the fact that almost only studies 
from anglophone countries were found might be an 
indicator for unpublished studies with negative out- 
comes from other countries. 

Conclusions 

It seems that provider training, if combined with guide- 
line implementation, contributes to enhanced care for 
depression in primary care even associated with possible 
positive symptom changes. Providing a guideline and 
training practitioners to adhere to guideline-concordant 
treatment might be a measure of intervention that 
endures even after the intervention ends. 
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